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Superposition Coding is Almost Always Optimal 
for the Poisson Broadcast Channel 

Hyeji Kim*, Benjamin Nachman^ and Abbas El Gamal* 


Abstract 

This paper shows that the capacity region of the continuous-time Poisson broadcast channel is achieved via superposition 
coding for most channel parameter values. Interestingly, the channel in some subset of these parameter values does not belong 
to any of the existing classes of broadcast channels for which superposition coding is optimal (e.g., degraded, less noisy, more 
capable). In particular, we introduce the notion of effectively less noisy broadcast channel and show that it implies less noisy but 
is not in general implied by more capable. For the rest of the channel parameter values, we show that there is a gap between 
Marton’s inner bound and the UV outer bound. 


I. Introduction 


The continuous-time Poisson channel is a canonical model of the point to point optical communication channel in the low 
power regime [1-3]. The capacity of this channel was established using different approaches by Kabanov [4], Davis [5], and 
Wyner [6, 7]. In particular, Wyner [6, 7] established the capacity using an elementary method in which the capacity is shown 
to be the the limit of the capacity of a certain memoryless binary channel. Wyner’s approach spurred several generalizations 
to multiple user Poisson channels. In [8] Lapidoth and Shamai established the capacity region of the Poisson multiple-access 
channel. In [9], Lai, Liang, and Shamai studied the Poisson interference channel. In [10], Bross, Lapidoth, and Shamai studied 
the Poisson channel with side information at the transmitter. In [11], Lapidoth, Telatar, and Erbanke studied the Poisson 
broadcast channel and established the condition under which the channel is degraded; hence the capacity region is achieved 
using superposition coding [12]. 

In this paper, which is an expanded and a more complete version of [13], we show that for the Poisson broadcast channel, 
superposition coding is optimal much beyond the parameter ranges for which the channel is degraded. We consider the 2- 
receiver continuous-time Poisson broadcast channel (P-BC) depicted in Figure 1. The channel input signal X{t) € [0,1], f > 0, 
that is, we assume a peak power constraint of 1 on 2f(f).Given X{t) = x{t), the output Yi{t) is a Poisson process (PP) with 
instantaneous rate Ai{x{t) + Si) for z = 1, 2, i.e., for {)<w<w + t<T, 


where 


P{Yi{w + t) - Yi{w) = k\X{t) = x{t), t e [o,r]} 


r'=(r(;,T) 

k\ 


g-r(u),T)^ 


fc G N, 


T{w,t) = 


pW-\-T 


Ai{x{t) + Si) dt. 


The parameter Ai is the channel gain for receiver z = 1, 2. The parameter Si > 0 is the rate of the input referred dark noise 
for receiver i. 

We consider the private message setting in which the sender X wishes to communicate a message Mi to receiver Tj at 
rate i?i and a message M 2 to receiver Y 2 at rate i? 2 , where Mi and M 2 are independent and uniformly distributed over 
[1 : 2”^i] X [1 : 2”^^]. The results we establish on the optimality of superposition coding can be readily extended to the case 
with common message [14]. We define a (2”^E 2"^^, rz) code, achievability, and the capacity region for this setting in the 
standard way [14]. 

As in [11], we use Wyner’s approach for the point-to-point Poisson channel to study the capacity region of the Poisson 
broadcast channel. As depicted in Figure 2-(a), time is quantized into intervals of length A > 0 and the continuous time P-BC 
is approximated in each interval by the binary memoryless broadcast channel (binary P-BC) depicted in Figure 2-(b), with 
transition probabilities: 


Oi — AiSiA -f O(A^), 02 — A 2 S 2 X + O(A^), 

bi = Aiil + si)A + 0(A2), 62 = 412(1 + S 2 )A + 0(A2). 


Following Wyner’s arguments, Lapidoth, Telatar, and Urbanke showed that the capacity region of the P-BC is equal to the 
capacity region of the 1/A-extension of this binary channel as A tends to zero. 
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Fig. 1: Two receiver continuous time Poisson broadcast channel. 



Fig. 2: (a) Time quantization, (b) Binary P-BC. 


In this paper, we show the surprising fact that superposition coding is optimal for almost all channel parameter values. We 
find explicit analytical expressions for the ranges of parameter values in which the channel is less noisy and more capable [15]. 
We then introduce the new class of effectively less noisy broadcast channels for which superposition coding is optimal and 
show that it includes the less noisy class but is not strictly included in the more capable class. The key idea is that the less 
noisy condition needs to hold only for channel input distributions that attain the maximum weighted sum rate. We find explicit 
analytical expressions for the ranges of parameter values for which the P-BC is effectively less noisy. By further strengthening 
the effectively less noisy condition, we show numerically that superposition coding can be optimal even when the channel 
is not more capable or effectively less noisy. Finally we show that for the remaining set of parameter values, there is a gap 
between Marton’s inner bound [16] and the UV outer bound [17]. 

The rest of the paper is organized as follows. In Section II we review superposition coding and the optimal superposition 
coding inner bound for binary input broadcast channels, and summarize known classes of broadcast channels for which 
superposition coding inner bound is tight. In Section III, we establish the parameter ranges for which a P-BC is less noisy and 
more capable. In Section IV, we introduce the new class of effectively less noisy broadcast channels for which superposition 
coding is also optimal, and obtain the parameter range for which a P-BC is effectively less noisy. In Section V, we show 
that for certain range of parameter values, there is a gap between Marton’s inner bound and the UV outer bound. Hence, the 
capacity region of the P-BC is still not known in general. In Section VI, we extend our results to the average power constraint 
case. Finally in Section VII, we remark on the optimality of superposition coding for general binary input broadcast channels. 
We demonstrate via an example that our intuition about when superposition coding is optimal for the broadcast channel can 
be quite misleading. 


II. Superposition coding inner bound 

Consider a 2-receiver discrete memoryless broadcast channel p{yi,y 2 \x). The superposition coding scheme [12] is motivated 
by broadcast channels for which one receiver is “stronger” than the other. This suggests a layered coding approach in which 
the weaker receiver (say I 2 ) recovers only its own message M 2 carried by the auxiliary random variable U, while the stronger 
receiver (Ti) recovers both messages {Mi, M 2 ) carried by X. This coding scheme leads to the inner bound on the capacity 
region of the general discrete memoryless broadcast (DM-BC) channel p{yi,y 2 \x) that consists of all rate pairs (i?i, R 2 ) such 
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that [18] 

Ri<I{X;Yi\U), 

R2<I{U;Y2), (2) 

i?i +i?2 < I{X;Yi) 

for some pmf p{u,x), and \U\ < \X\ + 1. 

Let IRi denote the region (2). Note that a second superposition coding inner bound can be readily obtained by exchanging 
Yi and Yz and Ri and Rz in (2). Denoting the second region by Jiz, it can be shown that if the capacity of the channel 
p{yi\x), Cl, is larger than the capacity of the channel p{yz\x), Cz, then Jlz C {{RijRz): RijCi + RzjCz < !}• 

Remark 1. In [19], Wang, Sasoglu, Bandemer, and Kim compared two superposition encoding schemes for the broadcast 
channel. The first is the UX scheme above, and the second is the ([/, V) scheme in which Mi is carried by V, Mz is carried 
by U, and is a function of (U, V). The weaker receiver y 2 again recovers M 2 carried by U and the stronger receiver Yi 
recovers Mi carried by V. They showed that the optimal inner bound achieved by the UV scheme can be strictly larger than 
that achieved by the UX scheme. It turns out, however, that if the broadcast channel has binary input, the optimal inner bound 
(assuming Ci > Cz) is co (5ii U {(0, Cz)}) [20]. Since in this paper we are concerned with the binary P-BC, which has binary 
input and outputs, we focus only on conditions under which either Jli or JI 2 is optimal. 

It is well known that region Iki is tight for the following classes of DM-BC. 

Definition 1 (Degraded broadcast channel [12]). For a DM-BC p(yi,y 2 lx) Y 2 is said to be a degraded version of Yi if there 
exists a random variable Y} such that Yl\{X = x} ~ pYi\x{y'i\x), i.e., Y} has the same conditional pmf as Yi (given X) and 
X ^ Y( ^ Y 2 form a Markov chain. 

Definition 2 (Less noisy channel [15]). For a DM-BC p{yi,y 2 \x) receiver Yi is said to be less noisy than receiver Y 2 if 
HU; Yi) > HU; Y 2 ) for all p{u, x). 

Van-Dijk [21] showed that receiver Yi is less noisy than receiver Y 2 if /(2f; Yi)—/(2f; Y 2 ) is concave in p{x), or equivalently, 
I{X;Yi) — /(2f; Y 2 ) is equal to its upper concave envelope Yi) — I{X;Y 2 )] (the smallest concave function that is 

greater than or equal to I{X;Yi) — I{X; Yz)). As we will see, this alternative condition of less noisy is significantly simpler 
to compute than the original condition. 

Definition 3 (More capable channel [15]). For a DM-BC p{yi,yz\x) receiver Yl is said to be more capable than receiver Y 2 
if /(X; Yl) > HX;Yz) for all p{x). 

The more capable condition can also be recast in terms of the concave envelope; Receiver Yi is more capable than Y 2 if 
Y 2 ) — /(AT; Yl)] = 0 for every p{x). 

It is also well known that degraded implies less noisy which implies more capable [15], but the converses do not always 
hold. In [22], Nair generalized the notions of less noisy and more capable. Let Vo be a class of pmfs p{u,v,x) such that for 
any triple of random variables {U, V,X) ^ p{u, v, x), there exists a pmf q{u, v, x) such that 

q{x) G Vo, 

HV;Yi)p < HV;Yi)„ 

HU;Yz)p<HU;Yz)q, (3) 

HV; Yi)p + HX; Y2\V)p < HV; Yi), + /(X; Y2 j Y),, 

HU; Y2)p + HX; Yl I U)p < HU; Yz)^ + HX; Yl I U)q. 

The notation I{V ; Yi)p refers to the mutual information between V and Yi when the input is generated according to piu, v, x). 

Definition 4 (Essentially less noisy [22]). For a DM-BC p{yi,yz\x), Yi is said to be essentially less noisy than Y 2 if there exists 
a sufficient class of pmfs Vo as defined in (3) such that /((7; Yi) > I{U; Y 2 ) for every p{x) G Vo and all (7 X ^ (Yi, Y 2 ). 

Definition 5 (Essentially more capable [22]). Eor a DM-BC p{yi,yz\x), Yi is said to be essentially more capable than Y 2 if 
there exists a sufficient class of pmfs Vo as defined in (3) such that I{X\Yi\U) > I{X-,Y 2 \U) for every p{x) G Vo and all 
U ^ X ^ {Yi,Y2). 

It can be easily seen from the definitions that less noisy implies essentially less noisy and more capable implies essentially 
more capable. However the converses do not always hold. Also it is shown in [22] that essentially less noisy neither implies 
nor is implied by essentially more capable. The capacity region for the essentially less noisy and the essentially more capable 
classes [23] is region 3?i as defined in (2). Also note that when the channel is essentially less noisy, the sum bound in (2) is 
always inactive. 
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III. Less noisy and more capable P-BC 


We evaluate the conditions for the less noisy and more capable stated in the previous section for the P-BC and show that 
the P-BC is less noisy for almost all channel parameter values. We assume without loss of generality that si < S 2 . The results 
for Si > S 2 can be similarly established. Also assume that Ai = a and A 2 = 1. The result for A 2 ^ 1 can be established 
using the fact that the capacity for (^ 1 ,^ 2 ) = ( 01 , 02 ) is equal to 02 times the capacity region for (^ 1 ,^ 2 ) = ( 01 / 02 , 1 ). 

In [11], Lapidoth et al. showed that >2 is a degraded version of Ti if a > 1. This can be seen either by directly inspecting 
the P-BC channel model or its binary P-BC counterpart. 

We now determine the conditions under which the P-BC is less noisy and more capable. 

We use the binary P-BC with the parameters given in (1) to extend the definitions of less noisy and more capable to the 
P-BC. Let 

I,(q) = lim X ~ Bern(g), t = 1,2, 

A-J-O A 

where X and VA are the input and outputs of binary P-BC as depicted in Figure 2-(b). Using Wyner’s results [ 6 , 7], it 
immediately follows that 

Ii(q) =a(- (q + Si) log((? -f Si) -f < 7(1 -f Si) log(l -f Si) -f (1 - q)si log(si)), 

-^ 2 ( 7 ) = -(q + S 2 ) log(q + S 2 ) + q(l + S 2 ) log(l -f S 2 ) + (1 - q)s 2 log(s 2 ), 
and that -fi(q) is maximized at 


qi = 


(1 + Si) 


l + Si 


- S. 


es: 


(4) 


(5) 


for 1 = 1,2. 

We define the less noisy condition for P-BC as follows. 

Definition 6. For the 2-receiver P-BC, receiver Yi is less noisy than Y 2 if Ii{q) — hiq) is concave in g € [0,1], i.e., if 
£[/i(g) - hiq)] = Ii{q) - hiq)- 

Similarly we define the more capable condition for P-BC as follows. 

Definition 7. For the 2-receiver P-BC, receiver Yi is more capable than Y 2 if Ii{q) — hiq) > 0 for every q € [0,1], i.e., if 
€[l 2 {q) - hiq)] = 0. 

To establish the parameter ranges for less noisy and more capable P-BC, we define the following breakpoints of a: 

1 Si 


OCl — 


0-2 — 


as = 


1 + 52 

S 2 l 0 g(l + I/S 2 ) - 1 
Sil0g(l -f 1/si) - 1’ 

(1 -f S2)l0g(l + I/S 2 ) - 1 
(1 -f si)log(l -f 1/si) - 1’ 


51 

a4 = —. 

5 2 

We also need the following lemma which characterizes the upper concave envelope of {Ii{q) — l2{q)) and {l2{q) — h{q))- 

Lemma 1. Consider a P-BC. 

1. Let 

0 if 0 < a < as, 

( 6 ) 


where 


9i{x) = 


f = ■^ ^(a) if as < a < ai, 

1 if a > ai, 

(1 -f S2) log ((1 -f S2)/(x -f S2)) - 1 + : 


(1 -f Si) log ((1 -f Si)/(x -f Si)) -1 + x 
Then, the upper concave envelope of {Ii{q) — l2{q)) is 


( 7 ) 


(L[h{q) - l2{q)\ = 


hiq) - hiq) for 0 < <7 < f, 

{l-q){Ii{t)-l 2 {t))/{l-t) fov q>t. 
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2. Let 


0 


<92 \a) 


1 


if 0 < a < a4, 
if Q!4 < Ct < 0(2, 

li a > a 2 , 


where 

, ^ _ 32 l0g(l +X/S2) -X 
^ Si log(l + x/si) - x' 
Then, the upper concave envelope of {I 2 W) — Ii{q)) is 


€[l2{q)-Ii{q)] 


{q/r){l 2 {r) - Ii{r)) for 0 < q < r, 
h{q)-h{q) for q>r. 


( 8 ) 


(9) 


The proof of Lemma 1 is in Appendix A. Figure 3 -(a) plots t vs a. The shaded area is where £[/i (q) — l2(q)] = /i(g) —/2(g). 
Figure 3 -(b) plots r vs a. The shaded region is where C[l2{q) — Ii{q)] = ^2(9) — h{q)- Figure 4 plots Ii{q) — l2{q) and 
€[Ii{q) — hiq)] for a = O.Saa + 0.7gi{q2) and a = 51(92) for (si,S2) = (0.1,1). Figure 5 plots 12(5) — l\{q) and 
^[hiq) — .^1(5)] for a = 0 . 3 a 4 + 0.752(51) and a = 52(51) for (si, S2) = (0.1,1). 




Fig. 3: (a) Plot of t vs o for si = 0.1, S 2 = 1- The shaded area is where £[ 71 ( 5 ) — l 2 {q)] = Il{q) — he., 0 < 5 < t. (b) Plot of r vs a for 

51 = 0.1, 52 = 1. The shaded area is where €[l 2 {q) — 7i(5)] = 72(g) — Ti(g), i.e., r < g < 1. 




Fig. 4: Plots of Ii{q) — 72(g) (solid) and £[7i(g) — 72(g)] (dashed line) vs g for (a) a = O.Sas + 0.7gi(g2) and (b) a = gi(g 2 ) (5i = 0.1,52 = 1). 


We are now ready to state the conditions for less noisy and more capable. 


Theorem 1. For si < 52 : 
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(a) (b) 

Fig. 5: Plots of / 2 (g) — Il{g) (solid) and £[ 72 (<?) — ^ 1 ( 9 )] (dashed line) vs q for (a) a = 0 . 3 a 4 + 0.7g2(qi) and (b) a = 92 ( 91 ) (si = 0.1,S 2 = 1). 


1. If a > ai, Fi is less noisy than Y2 and the capacity region is the set of rate pairs {Ri, R2) such that 

Ri < Phipo), 

R2 < hiPpo + P) - Phijpq) 

for some 0 < /3,po < 1- 

2. If a 2 < o: < cii, Yi is more capable than Y 2 and the capacity region is the set of rate pairs (i?i, R 2 ) such that 

Ri < PoIiiPo) + PiliiPi) + P2ll{P2), 

R2 < hip) — PohiPo) — Pihipi) — P2l2iP2), (11) 

Ri + R2 < h (p) 

for some 0 < Po, Pi, P 2 ,Po,Pi,P 2 < 1, where /3o + /3i + /32 = 1 and p = PoPo + PiPi + P 2 P 2 - 

3. If 0 < a < 0:4, Y 2 is less noisy than Yi and the capacity region is the set of rate pairs (i?i, R 2 ) such that 

^1 < hiPpo) - PhiPo), 

R 2 < PhiPo) 

for some 0 < P,po < 1- 

4. If a 4 < a < as, Y 2 is more capable than Yi and the capacity region is the set of rate pairs such that 

< h(p) - Poh(Po) - Pih(pi) - P 2 h{p 2 ), 

R2 < PohiPo) + Pihipi) + P2lliP2), 

Ri + R2 < I2 (p) 


for some 0 < Po, Pi, P2,Po,Pi,P2 < h where /3o + /3i + /32 = 1 and p = PoPo + PiPi + P2P2- 

The capacity region for Yi less noisy than Y 2 is the superposition rate region Jli without the sum rate bound for U ~ Bern(/3) 
and p{x\u) is the Z-channel shown in Figure 6-(a) where P,po € [0,1]; and the capacity region for Y 2 less noisy than Yi is 
(R 2 for U ~ Bern(/3) and p{x\u) is the Z-channel shown in Figure 6-(b) where P,po € [0,1]. 


Po 



Po 



Fig. 6: (a) p(x\u) for Yi less noisy than Y 2 . (b) p{x\u) for Y 2 less noisy than Yi. 
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The capacity region for Yi more capable than I 2 is the superposition rate region with U G {0,1, 2}, pu{j) = Pj, and 
XliU = 7 } ~ Bernfp,), 7 = 0,1, 2; and the capacity region for Yq more capable than Yi is the superposition rate region i 3^2 
with puU) = Pj and X\{U = j} ~ Bern(pj) for j = 0,1, 2. 

Figure 7 illustrates the ranges of a for which the P-BC is degraded, less noisy and more capable. From Theorem 1 the 
P-BC is less noisy if a < 04 or a > ai and more capable if a < q ;3 or a > a 2 - 



less noisy 

more capable 

less noisy 



degraded 









0:4 Ck3 0-2 O-l 1 

Fig. 7: Illustration of the ranges of a for which the channel is degraded, less noisy, and more capable as stated in Theorem 1. 


Proof of Theorem 1: 

We first find the condition on a for one receiver being less noisy or more capable than the other receiver. 

1. By part 1 of Lemma 1, C — ^ 2 ( 7 ) if and only if a > ai. Thus Yi is less noisy than I 2 iff a> ai. 

2. By part 2 of Lemma 1, €\l 2 {q) — ri(( 7 )] = 0 if and only if a > q; 2 - Thus Yi is more capable than Y 2 iff a > q; 2 - 

3. By part 2 of Lemma 1, €[l 2 {q) — ^ 1 ( 7 )] = l 2 {q) — Ii{q) if and only if 0 < a < 04 . Thus I 2 is less noisy than Yi iff 
0 < a < 04 . 

4. By part 1 of Lemma 1, C[Ii{q) —l 2 {q)] = 0 if and only if 0 < a < a^. Thus Y 2 is more capable than Yi iff 0 < a < a^. 

We now obtain the capacity region for P-BC such that Yi is more capable than Y 2 (part 2) and for P-BC such that Yi is 
less noisy than Y 2 (part 1). By exchanging Yi and Y 2 and /i(-) and l 2 {-), the capacity expression for part 4 and part 3 of 
Theorem 1 can be obtained similarly. 

For the 1/A-extension binary P-BC shown in Figure 2-(b), superposition coding inner bound is the set of rate pairs (i?i, R 2 ) 
such that 


i?i < {l/X)I{X-Y^^\U), 

R2<{1/X)I{U-Y^), (13) 

Ri+R 2< (l/X)I{X-Y^^) 


for some pmf p(u,x), and \U\ < \X\ -f 1. 

Let U € {0,1, 2} where puii) = Pi and px\uiM'i) = Pi G [0,1]. As A —>■ 0, the region in (13) is equivalent to the region 
in (11) in Theorem 2. If a P-BC is more capable, the region in (11) is indeed the capacity region. This is because the UV outer 
bound for 1/A-extension binary P-BC as A —s- 0 reduces to the inner bound in (11) under the condition for more capable 
P-BC. 

For part 1 of Theorem 1, note that the capacity region is the region in (11) with an inactive sum bound, i.e., the set of rate 
pairs (i?i,i? 2 ) that satisfy 


Rl< 

ie{0,1.2} 

R2<l2( X! “ X! 

^iG{0,l,2} ^ ie{0,l,2} 


(14) 


for some Pi,pi G [0,1] such that X]iG{o 12 }/^* = 1- Let 3?' and Ji" denote the region in (10) and (14) respectively. Note 
that IR' is the set of rate pairs (i?i,i? 2 ) that satisfy inequalities in (14) for (Po,Pi,P 2 ) = (P,P,0) for some P G [0,1] 
and po G [0,1] and pi = 1. Thus Ji' C 3?". We now show that every supporting hyperplane of 3?" intersects 3?', i.e.. 





max(fl.j + -^ 2 ) < inax(^j +?’ 2 )- Consider 


max (Ai?i + R2) 


max f ^ PiiXhiPi) - hijpi)) + h{ ^ ftp*)') 

..p.e[o,i] Vjg^Q ;^ 2} ie{o,i,2} ^ 

max ( max ( ; 3 i(A/i(pi) -/2(pi))) +/2(p) 

>e[o,i] Vft.Pi:EftPi=P V ' 'V 


iG{0,1.2} 


= max ((L[\Ii{p) - hip)] + hip)] 
pe[o,i] \ / 

= maxr max (A/i(p) -/2(p) +/2(p)), max [)! - p)i\Iiit') - hit'))/il 


< max (/ 3 (A/i(po) - hiPo)) + hiPpo + P)) 
/3,Po6[0.1] 

= max (Ari+r2). 

()'l,r-2)eK' 



Step (a) holds by Lemma 1. To prove step (&), note that the two terms in (15) are obtained by letting (/3,po) = (l,p) and 
iP,Po) = ((1 - P)/(1 - t'), t') in (16). 

We now show that Ji' is convex. Suppose Ji' is not convex, i.e., co{5i'} ^ Ji' where co{5J'} denotes the convex hull of 
Ji'. There exists a rate pair (ri,r 2 ) € co{3^'} on the boundary of co{3^'}, i.e., pri + r 2 = max(/jj /j 2 )Gco{ 3 t'}(M.Ri + R 2 ) 
for some /i > 0 such that (ri,r 2 ) ^ 3^'. Note that (ri,r 2 ) = ? 7 (rio,r 2 o) + (1 — ?' 2 i) for some 0 < 77 < 1 and 

irio,r 2 o), (rii,r 2 i) € 3^'. Since max(flj_^ 2 )eco{ 5 :'}(Ai-Ri + R 2 ) = max(flj_^ 2 )e 5 :'(Mf?i + R 2 ), the two rate pairs (rio,r 2 o) 
and (rii,r 2 i) satisfy 


prio + r2o = prii + r2i = max ipRi + R2). ( 17 ) 

We now show that the equality ( 17 ) cannot hold for (rio,r2o) 7^ (?'ii5f2i), i-C-, there exists a unique rate pair (i?i,i?2) such 
that pRi + i?2 = max(ri,r2)e3t'(/t^i + ^2)- Consider 

max Uri + r2) = max ihiPpo + P) + pPhipo) - jShiPo))- 
(i’i,r-2)e3t' / 3 ,po 6 [ 0 ,l] ^ 

We show that i( 3 ,po) that achieves the maximum is unique. Note that 

max {hiPpo +P)+ pPhipo) - Phipo)) = max (hip) + ^[phip) - hip)])- 

Since hip) is strictly concave, hip) + C[/r/i(p) — hip)] is strictly concave. Let p* the unique solution that maximizes 
hip) + ^[phip) - hip)]- Then 

max (hiPpo+P) + pPhipo) - Phipo)) = hip*) + €.[phip*) - hip*)] 

P,PO 

= max {hip*) + pPhipo) - PhiPo)) 

/S,po:/Spo+/ 3 =p* 

Finally by Lemma 1 , there exists a unique iP,po) such that Ppo P = p* and 


€[phip*) - hip*)] = pPhiPo) - Phipo). 


Thus 3^' = co{3i'}. To complete the proof for 3i' = 3^", we use Lemma 2 below. 


Lemma 2. [24] Let 31 S be convex and 3?' C 31" be two bounded convex subsets of 31, closed relative to 31. If every 
supporting hyperplane of 3?" intersects 3?', then 3i' = 3J". 

As mentioned in the first part of this section, superposition coding is also optimal for the essentially less noisy and essentially 
more capable classes. Can we extend the range of parameter a for which superposition coding is optimal by evaluating the 
conditions for these two classes? 

To answer this question, first note that or binary input broadcast channels, the essentially more capable condition in [22] 
reduces to that for the more capable class; hence essentially more capable does not extend the range of a for which superposition 
coding is optimal beyond more capable. To see this, consider a binary input broadcast channel which is not more capable. 
Then there exists p,q G (0,1) such that IiX;Y 2 )p — IiX;Yi)p > 0 and IiX;Yi)q — IiX;Y 2 )q > 0. Then 

€[IiX; Y 2 )r - HX; Yi)r] > 0 for every r G (0,1), 

€[IiX;Yi)r - IiX;Y 2 )r] > 0 for every r e (0,1). 
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Thus, for every X ~ Bern(r) for r G (0,1), there exists Ux and U 2 such that I{X] T 2 |C^i)—^i|C^i) > 0 and I{X-,Yi\U 2 ) — 
I{X]Y 2 \U 2 ) > 0. Hence if a binary input broadcast channel is not more capable it is also not essentially more capable. 

The answer for essentially less noisy is less clear. It appears to be quite difficult to evaluate the set of pmfs Vo that satisfy 
the condition in (21). In the following section, we define a new class of broadcast channels for which the condition can be 
easily evaluated and which includes the essentially less noisy class. 


IV. Effectively less noisy P-BC 

Consider outer bound CRi on the capacity region of the DM-BC p(j/i, y 2 |a;) which consists of all rate pairs such 

that 


R 2 <I{U-,Y 2 ), 

Ri + R 2 < HU; Y 2 ) + I{X; Vi I U) 


(18) 


for some p{u, x). To see that this is indeed an outer bound, note that it is simply the Korner-Marton [16] outer bound without 
the sum rate bound. 

The outer bound can be alternatively represented in terms of its supporting hyperplanes as: 

^ , /maXp(„,,)(A/(X;ri|C/)+J([/;r2)) if 0 < A < 1, 

(^iT2)G3ti \maXp(„,,)A(/(C/;y2) + /(X;yi|[/)) if A > 1. 

Now consider the supporting hyperplane representation of the superposition rate region in (2): 

/maXp(„,,)(Amin{J(X;ri|[/),J(X;ri)-/(C/;y2)} + /(C/;r2)) if 0 < A < 1, 
|maxp(„_^) A(/([/;ri)+/(X;Yi|[/)) if A > 1. 

Note that and differ only in the first term when 0 < A < 1: for IRi, the term is \I{X;Yi\U), while for the term is 
A min{/(Ar; Fi|(7), /(2f; Fi) — /([/; ^ 2 )}- When A > 1, for 3?!, the term is A/(C/; ^ 2 ), while for the term is XI(U;Yi). 

Now it is easy to see that if the DM-BC is less noisy, i.e., I{U;Yi) > I{U;Y 2 ) for all p{u,x), then these two bounds 
coincide. The key observation that leads to a more general class than less noisy is that the inequality I{U;Yi) > I{U;Y 2 ) 
does not need to hold for every p{u, x). For example, if I{U; Yi) > I{U ; Y 2 ) for every p{x) € V and every p{u\x) such that 


max max (A/( 2 f; Yi |[/)+ /([/; F2)) = max max (A/(2f; Fi |[/) + /([/; F2)) 

p{x) p{u\x) p^x)^!^ p{u\x) 


( 20 ) 


for every 0 < A < 1, then and ;ki coincide. 

This particular example is quite interesting because both the condition for the set V and the inequality I(U;Yi) > UU;Y 2 ) 
can be expressed in terms of the upper concave envelope of /(Jf;Fi) — I{X;Y 2 ), which makes their evaluation quite 
straightforward especially for binary broadcast channels. 

We are now ready to introduce a new class of broadcast channels for which superposition coding is optimal. 


Definition 8 (Effectively less noisy broadcast channels). For a DM-BC, p{yi,y 2 \x), let V be the set of pmfs p{x) such that 
for every 0 < A < 1, 

meii^{HX;Y2) + €[XIiX;Yx)-HX;Y2)]) = max {l{X;Y2) + H^HX;Yx) - I{X;Y2)]). (21) 

p{x) p{x)gV ' ^ 

Receiver Yi is said to be effectively less noisy than receiver F 2 if I{X;Yi) — I{X;Y 2 ) = €\HX;Yi) — I{X;Y 2 )] for every 
p{x) € 7. 


Clearly if the DM-BC is less noisy, then it is effectively less noisy. We can further show that if the channel is essentially 
less noisy as defined in [22], it is also effectively less noisy. To show this note that the sufficient class Vo in (3) must satisfy 


max max (A/(X; Yi |[/)-I-/((7; F 2 )) < max max (A/(2f; Yi |(7)-f/([/; F 2 )) 

p{x) p{x)^Vo p{u\x) 

for every 0 < A < 1. Hence Vo ^ V. If /([/; Yi) < /([/; Y 2 ) for p{x) € Vo and every p{u\x), then /([/; Yi) < /([/; Y 2 ) 
for p{x) G V and every p{u\x). We do not know if the condition for effectively less noisy is strictly weaker than that for 
essentially less noisy, however. As we will see in the next section, effectively less noisy neither implies nor is implied by more 
capable in general. 

The definition of effectively less noisy can be readily extended to the P-BC in the same manner as the less noisy and more 
capable we presented in the previous section. 

Definition 9 (Effectively less noisy P-BC). For the 2-receiver P-BC, let Q C [0,1] be such that for every 0 < A < 1, 


max hiq) + e:[A/i(g) - /2(g)] = max 12(9) + e:[A/i(g) - hiq)]- 
ge[o,i] 


( 22 ) 
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Receiver Yi is said to be effectively less noisy than I 2 if hio) — hiq) = ^^[-^ 1 ( 9 ) “ ^^ 2 ( 9 )] for every q € Q. 

To establish the parameter ranges for effectively less noisy P-BC, we need the following additional breakpoints of a: 

ai2 = 9i{q2), 

0:23 = 92{qi) 

where the functions (/i(-) and p 2 (-) are defined in (7) and (9), respectively. 

Theorem 2. Consider a 2-receiver P-BC and assume that si < S 2 - 

1. If a > ai 2 , Ti is effectively less noisy than Y 2 , and the capacity region is the set of rate pairs that satisfy (10). 

2. If a < 023 , ^2 is effectively less noisy than Yi, and the capacity region is the set of rate pairs that satisfy (12). 

Note that as for the less noisy case, the capacity region for effectively less noisy channels is also attained using binary U 
and a Z-channel from U to X. Figure 8 illustrates the parameter ranges for which superposition coding is optimal. 


eff. less noisy r 



less noisy 

more capable 

1 

less noisy 

1 



degraded i- 

1 




1 ^ 


a4 as a23 0-2 otvi 1 

Fig. 8: Illustration of the ranges of a for which the channel is degraded, less noisy, more capable, and effectively less noisy as stated in Theorems 1 and 2. 


It can be shown that q ;4 < eta < 023 < ct 2 < ai 2 < (see Appendix B). The fact that < q;i 2 implies that if Yi is 
effectively less noisy than y 2 , it is also more capable than y 2 (see Figure 4-(b) for an example). Thus effectively less noisy does 
not offer any new range of parameters for which superposition coding is optimal. On the other hand, the fact that < 023 
implies that for a G (eta, etas], ^2 is effectively less noisy but is not more capable than Yi (see Figure 5-(b) for an example). 
Thus effectively less noisy offers a new range of parameters for which superposition coding is optimal. 

Proof of Theorem 2 : 

1. We first show that Q = [0, 92 ] satisfies (22). Specifically we show that l 2 {q) + C[A/i((?) — l 2 {q)] is decreasing in [ 52 ,1]- 
Since l 2 {q) + C[A/i(g) — l 2 {q)] is concave, it suffices to show that the derivative of l 2 {q) + C[A/i(( 7 ) — hiq)] at g = <72 
is nonpositive. Consider 


djhiq) + £[A/i(g) - /2(g)]) (a) f XI[{q2) if 92 > t' 

dq \l2i92) - (A/i(f') - I2(t'))/{1 -1 ') otherwise. 

(b) 

< 0 

for t' = t{a') where a' = Aa. Step (a) follows by rewritting A/i(<7) for Ai = a as Ii{q) for Ai = \a (see (4)) and 
then applying Lemma 1. Step ( 6 ) holds because \I[{q2) < 0 and —(AJi(f') — l2{t'))/{\ — t') < 0 . The first inequality 
holds since si < S2 implies qi < 92 and XI[{qi) = 0 . The second inequality holds by ( 30 ). 

By Lemma 1 , €[Ii{q) — hiq)] = Iiiq) — l2{q) for q G [0,(72] if and only if o; > 51 ((72) = 012 (also see Figure 3 (a)). 
2 . We first show that Q = [(71, 1 ] satisfies ( 22 ) with /i(-) and l2{-) interchanged. Specifically we show that Ii{q) + 
€[Xl2{q) — Ii{q)] is increasing in [ 0 , gij. Since Ii{q) + £[A/2((7) — Ii{q)] is concave, it suffices to show that the 
derivative of Ii{q) -f (i[Xl2{q) — fi((?)] at (7 = (71 is nonnegative. Consider 


d{hiq) + €[Xl2iq) - hiq)]) 


dq 


d{h{q) + X€[l 2 {q)-Iiiq)/X]) 




i^ixi 2 iqi) ifqi>r' 

-I- (A/ 2 (r') - Ii{r'))/r' otherwise. 

(b) 

> 0 


for r' = r{a') where a' = a/X. Step (a) follows by rewritting Ii{q)/X for Ai = a as Iiio) for Ai = a/X (see (4)) and 
then applying Lemma 1. Step (6) holds because Xl^iqi) > 0 and {Xl 2 {r') — /i(r'))/r' > 0. The first inequality holds 
since si < S 2 implies qi < 92 and Xl 2 {q 2 ) = 0. The second inequality holds by (30). 

By Lemma 1, €[ 12 ( 9 ) — fi(7)] = fa(< 7 ) — Ii{q) for q £ [gi, 1] if and only if a < 92 ( 91 ) = 023 (also see Figure 3 (b)). 
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Figure 9 illustrates the parameter ranges for which superposition coding is optimal. As can be seen the area in the a-S 2 
plane where superposition coding is not optimal becomes smaller as si increases. In Appendix C, we show that the fraction 
of the channel parameter space for which superposition coding is optimal approaches one; hence superposition coding is in a 
sense almost always optimal for the P-BC. In comparison, the fraction of the parameter space for which the P-BC is degraded 
is always bounded away from 1. 



Fig. 9: Plots of a versus 52 for 51 = 0 (top left), 5i =0.1 (top right), 5i = 1 (bottom left) and si = 3 (bottom right). The shaded areas in each plot are 
where superposition coding is optimal and the light shaded area (a > 1) is where Y 2 is a degraded version of Yi. 


As illustrated in Figure 8, the capacity region of the P-BC is achieved using superposition coding in the ranges a < a 23 
and a > a 2 - Note that for the remaining range of a the channel is not more capable or effectively less noisy, which follows 
immediately from the if and only if conditions established in Theorems 1 and 2. In the following section we explore bounds 
on the capacity region in the range for a € (q; 23 , 0 : 2 )■ 


V. Gap between Marton and UV 


The best known inner and outer bounds on the capacity region of the DM-BC are the Marton inner bound [16] and the UV 
outer bound [17], respectively. We show that for a € (a 23 ,a 2 ) there can be a gap between these two bounds. 

In [25], Geng, Jog, Nair and Wang showed that for binary input broadcast channels, Marton’s inner reduces to the set of 
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rate pairs such that 


k 

i?i </(ly; yi) + ^/3,/(X; Fi I= j), 

5 

R2<I{W-Y2)+ Y. P,I{X-Y2\W = j), (23) 

k 5 

<min{/(iy;yi),/(iy;r2)} + ^/3,/(X;yi|M/= j)+ ^ /3,/(y;y2|iy = j) 

i=i i=fe+i 

for some pw{j) = /3j, j G [1 : 5], and p{x\w). This region is achieved using randomized time-division [26]. This ingenious 
insight helps simplify the computation of Marton’s inner bound for the binary P-BC; hence for the P-BC itself. 

The UV outer bound on the capacity region of the DM-BC is the set of rate pairs (i?i, R 2 ) such that 


Ri<IiV;Yi), 

R2<I{U;Y2), 

Ri+R2<IiV;Yi) + I{U;Y2\V), 
Ri+R2<I{V-,Yi\U)+I{U-,Y2) 


(24) 


for some p{u,v) and function x{u,v), Wl\V\ < IT”! + 1. Computing this bound even for binary input broadcast channels 
is quite difficult. Hence, we compute the maximum sum rates for the Marton and the UV bounds instead of the complete 
bounds. Figure 10 plots the maximum sum rates for q ;23 < a < a 2 when si = 0.1 and S 2 = 1 (q ;23 = 0.27, q ;2 = 0.4). Note 
that for 0.27 < a < 0.286, the sum rates coincide. For the rest of the range there is a small gap between the Marton and the 
UV bound sum rates. In particular for a = 0.34, the gap is approximately 0.0039. 



Fig. 10: Plots of the maximum Marton sum rate (dot), superposition sum rate (dash), and UV sum rate for si = 0.1, S 2 = 1, and a € [a 23 , 0 : 2 ]- 

It turns out that superposition coding is optimal in the range where Marton’s sum rate and the UV outer bound sum rate in 
Figure 10 coincide, i.e., for a € [0.27,0.286]. To show this, note that the condition for effectively less noisy in (20) can be 
tightened further. It is clearly sufficient that I(U;Yi) > I{U; Y2) for every p*{u, x) G CP(U, X) and every 0 < A < 1 such that 

max (A/(y;Yi I (/) + /([/; ^2)) = max {XI{X;Yi\U) + I{U;Y2)). ( 25 ) 

p(u,x) p* (u,x)g0^(17,X) 

This is the condition satisfied for the P-BC in the range a G [0.27,0.286]. However, unlike the looser condition (20), where 
we are able to express the class of pmfs in terms of the concave envelope of I(X;Yi) — /(AT; I 2 ) and determine analytically 
the range in which the P-BC is effectively less noisy, we can only numerically evaluate the above condition. 

VI. Average power constraint 

The results on superposition coding in sections II- IV can be readily extended to the case when there is also an average 
power constraint. Suppose that in addition to the maximum power constraint X(t) < 1 (which is needed for the capacity to 
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be finite) there is an average power constraint , i.e., 

T 

X{t)dt < a. (26) 

The capacity region of P-BC for the average power constraint setting is equal to the capacity region of the corresponding 
binary P-BC for E[2f] < a. This is a simple extension for Wyner’s argument that the capacity of point-to-point Poisson channel 
is the equal to the capacity of the corresponding binary channel for E[26] < a [6, 7]. 

Note that if a broadcast channel is less noisy or more capable, the UV outer bound in (24) with an input constraint coincides 
with the superposition inner bound with the same input constraint. Hence, if a P-BC is less noisy, it is also less noisy under the 
average power constraint, and similarly if a P-BC is more capable, it is also more capable under the average power constraint. 

In contrast, if a P-BC is effectively less noisy, it is not necessarily effectively less noisy under the average power constraint. 
The results of effectively less noisy, however, can be easily extended to the average power constraint setting. 

Theorem 3. For P-BC with an average power constraint in (26), 

1. Receiver Yi is effectively less noisy than T 2 if ct > 3i(niin{(T, 52 }), 

2. Receiver Y 2 is effectively less noisy than Ti if a < p 2 (niin{cr, gi}) 
where the functions 5 i(-) and g 2 i-) are defined in (7) and (9), respectively. 

Proof: 

Note that X ~ Bern(g) for q G [0, cr] satisfies the average power constraint. The condition in Definition 9 can be modified 
to the average power constraint setting. Let Q C [0, cr] be such that for every 0 < A < 1, 

max l 2 {q) + €[XIi{q) - / 2 (g)] = max 12 ( 9 ) -b €[\Ii{q) - hiq)]- (27) 

i?e[o.<T] qeQ 

Receiver Yi is effectively less noisy than Y 2 under the average power constraint if Ii{q) — hiq) — ^[h{q) — -^ 2 ( 9 )] for every 

q&Q- 

1. Recall that for P-BC without an average power constraint, Yi is effectively less noisy than Y 2 if 

hiq) - h{q) = ^[hiq) - hiq)] for g G [ 0 ,( 72 ]. 

For P-BC with an average power constraint q < a, Yi is effectively less noisy than Y 2 if 

h{q) - hiq) = ^[h{q) - h{q)] for q G [0,minja,( 72 }]. 

If a > ( 7 i(min{CT, ( 72 }), Yi is effectively less noisy than Y 2 under the average power constraint. 

2. Recall that for P-BC without an average power constraint, y 2 is effectively less noisy than Yi if 

hiq) - hiq) = ^[hiq) - hiq)] for g G [gi, 1]. 

If there is an average power constraint q < a for some cr > gi, then Y 2 is effectively less noisy than Yi under the 
average power constraint if 

hiq) - hiq) = £[ 72 (g) - 7i(g)] for g G [gi,CT]. 

Thus for a < ( 72 ( 91 ), ^2 is effectively less noisy than Yi under the average power constraint. 

If there is an average power constraint q < a for some cr < gi, then y 2 is effectively less noisy under the average power 
constraint than Yi if 

hiq) - hiq) = ^[hiq) - hiq)] for g = cr. 

Thus for a < (72 (< 7 ), Y 2 is effectively less noisy under the average power constraint than Yi. 



VII. Final remarks 

We showed that superposition coding is optimal for almost all Poisson broadcast channels, and that when superposition is 
not optimal, there is a gap between Marton’s inner bound and the UV outer bound. Hence the capacity region for the P-BC 
is still not known in general. 

We note that in [27], Geng, Nair, Shamai, and Wang similarly showed that for the class of binary input symmetric output 
broadcast channels (which do not include the binary P-BC) either superposition coding is optimal or there is a gap between 
Marton’s inner bound and the UV outer bound. 

We introduced the effectively less noisy broadcast channels for which superposition coding is optimal. This condition for 
effectively less noisy can be further extended, but we can only verify it numerically. 

Why is this the case and does it hold for general binary input broadcast channels? 
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The intuitive reason superposition coding is almost always optimal for the P-BC is that the binary P-BC is more capable 
(and even less noisy) for most parameter ranges as established in Appendix C. Hence one channel is almost always stronger 
than the other. 

For other classes of binary input broadcast channels, the more capable condition is much less likely to be satisfied. As an 
extreme case, consider the skewed binary broadcast channel in Figure 11-(a), which is a generalization of the skew symmetric 
BC in [26]. We can show that the channel is not more capable (hence also not less noisy or degraded) for every pi,p 2 G (0; !)• 
It turns out, however, that for {pi,P 2 ) in the dark shaded area in Figure 11-(b), the channel is effectively less noisy, and for 
ipi,P 2 ) in the lightly shaded area in Figure 11-(b), the BC is not effectively less noisy but superposition coding is still optimal 
(which is shown by verifying the stronger condition in (25)). These shaded areas constitute 76% of the parameter space area! 
This clearly demonstrates that our intuition about when superposition coding is optimal does not always hold. The unshaded 
area in Figure ll-(b) is where Marton’s sum rate is strictly greater than the superposition sum rate (i.e., max{Ci,C 2 }), which 
implies that superposition coding is not optimal. 



Pi 

(a) (b) 

Fig. 11: (a) skewed binary broadcast channel (b) Plot of pi vs p 2 . The lightly shaded area is where superposition coding is optimal and the dark shaded area 
is where one receiver is effectively less noisy than the other receiver. 


To perform the analytical and computational evaluations, we relied heavily on the concave envelope method that has been 
used in other applications, including [28] which establishes the optimality of dirty paper coding and superposition coding for 
MIMO broadcast channels with common message, and [29] which shows that UV outer bound is not tight. 

Finally, it would be interesting to find a similar extension of the more capable to the notion of effectively less noisy. The 
difficulty is finding an outer bound similar to the one we used for effectively less noisy. 
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Appendix A 
Proof of Lemma 1 

We first prove that the function {l 2 iq) — Ii{q)) can change concavity at most once in the range g G [0, Ij. In particular, the 
function l 2 {q) — Ii{q) is (i) concave in [ 0 , 1 ] if a < 0 : 4 , (ii) convex in [ 0 , 1 ] if a > ai, or (hi) convex in [ 0 , k] and concave 
in [k, 1 ] otherwise, where 


K = 


CtS2 — Si 

1 — a 


€[ 0 , 1 ]. 


To show the above statement, note that the second derivative of l 2 {q) — Ii{q), 


i2iq)-iiiq) 


(a — l)g + CXS 2 — Si 

(<7 + si)(g + S2) 


has a unique zero, is nonnegative if a > ai, and is nonpositive if a < 0 : 4 . Now we are ready to prove Lemma 1. 
1. Let fi(q) = (Ii(q) — l 2 (q))/(I — q). We first show that t defined in ( 6 ) satisfies 


(28) 


(29) 


t = arg max fi{q). 


( 30 ) 
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It can be shown that 

/[{q) = (1 - - gi(g)) f (1 + Si) log ^ ^ - l + q\, 

V g + Si J 

where gi{q) is defined in (7). It can be easily checked that f[{q) and {a — gi{q)) have the same sign for 0 < g < 1 and 
gi{q) is increasing for q> Q. Hence 

a) if a < gi(0) = as, then f[{q) < 0 in [0,1] and fi{q) is maximized at t. 

b) if a > limq_ 5 .i gi{q) = ai, then f[{q) > 0 in [0,1] and fi{q) is maximized at t. 

c) otherwise, f[{q) > 0 in [0, ( 7 ||"^(a)] and f'{q) < 0 in [g^^(a), 1] and fi{q) is maximized at t. 

Let 


L ( ) = if 0 < g < f, 

if q>t. 

Then Li{q) > Ii{q) - hiq) because fi{t) > fi{q) for any q G [0,1]. 

On the other hand, note that for q > t, Li{q) can be expressed as 

L,iq) = ((1 - g)/(l - t))ihit) - h{t)) + {{q - f)/(l - f))(/i(l) - h{l)). 

Since g = ((1 — q)/{l — t)) ■ t + ((g — t)/{l — t)) ■ 1, it follows that for q > t, Li{q) is a convex combination of 
hit) - hit) and /i(l) - hih- Thus €[hiq) - hiq)] > Ti(g)- 

Finally to argue that Li(q) = €[Iiiq) — 72(g)], we are left to show that Li(g) is indeed concave. We first show that 
Liiq) is concave in [0, t]. Recall that Iiiq) — hiq) is concave in [0, k] for n in (28). Thus it suffices to show that t < k. 
Equivalently we show that /{(k) < 0. Consider 


hiq)-hiq) ihil)-hil))-ihiq)-hiq)) 

1 -q (1 - g)^ 

= ihiq) - hiq))/a - g) - ihiu) - hiu))/ii - g) 


for some q < u < 1. The inequality /{(k) < 0 holds since /f (g) — It^iq) > 0 for q > k as can be seen from (29). We 
conclude that Liiq) is concave because it is concave in [0,f] and [t, 1] and is differentiable at t. 

2. Let / 2 (g) = ihiq) — hiq))/q- We show that r defined in (8) satisfies 


It can be shown that 


r = arg max /2(g)- 

9e[o.i] 


(31) 


hiq) = g - g2(g))(g - Si log(l + q/si)), 

where (72(g) is defined in (9). It can be easily seen that / 2 (g) and (a — (72(g)) have the same sign for 0 < q < 1. Also 
it can be shown that (72(g) is increasing for q > 0. Hence 

a) if q; < lim(j_ 5 .o ( 72 (g) = < 24 , then hiq) < 0 in [0,1] and / 2 (g) is maximized at r. 

b) if q; > ( 72 ( 1 ) = 012 , then / 2 (g) > 0 in [ 0 , 1 ] and / 2 (g) is maximized at r. 

c) otherwise, / 2 (g) > 0 in [0, ( 7 ^^(a)] and / 2 (g) < 0 in [gt^^^ia), 1] and / 2 (g) is maximized at r. 

Let 

L (a) = [ ^^/hihir) - hh)) if 0 < q < r, 

^ \72(g)-7i(g) ifg>r. 

Then hiq) - hiq) < L 2 iq) because / 2 (r) > hiq) for any q G [0,1]. 

On the other hand, €\hiq) — 7i((7)] > L 2 iq) because for 0 < q < r, L 2 (g) is a convex combination of hh) — hh) 
and 12 ( 0 ) — 7i(0). To complete the proof we now show that ^ 2 ( 7 ) is concave. Recall that hiq) — hiq) is concave in 
[k, 1] for K in (28). To show that L 2 iq) is concave in [r, 1], we show k < r, i.e. fhh) > 0. Consider 

h{q) = hiq)-hiq) _ hiq)-hiq) 

= ihiq)-hiq))/q-ihiu)-hiu))/q 

for some u < q. Since I^iq) —/"((?) > 0 in [0, k], it follows that fhh > 0. We conclude that L 2 (g) is concave because 
it is concave in [0, r] and [r, 1] and is differentiable at r. 
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Appendix B 
Order oe breakpoints 

Since less noisy channels are also more capable it follows that a 4 < as and a 2 < ai. Also Yi and I 2 cannot be more 
capable simultaneously unless the two channels are identical. Thus, 0 < q ;4 < as < q ;2 < ai. To completely characterize the 
order, we need to show the following: 

(a) (6) (c) (d) 

0^2 ^ o^l 2 ^ ai and as ^ a 2 s ^ CI 2 . 

Note that inequalities (6) and (d) follow by Lemma 1. The remaining inequalities were shown through simulation. To show 
(a), we checked using Mathematica that the minimum of ai 2 — a 2 for 0 < si < S 2 is nonnegative. To show (c), let 

a:log(l + l/x) - 1 

'iv(x) ^ _ 

(1 + x) log(l + cc) - (1 + x) log((72 + x) - 1 + <72 ’ 

Then the condition as < a2s is equivalent to w{x) < w{s2) for all x < S2 for any <72, and we checked that this is true using 
Mathematica and Maple. 


Appendix C 

VOLUME 

Consider the set of channel parameters: {a, si, S 2 ) € [0,1] x [0, b] x [0, kb] for b,k > 0. Let bo = min{5, kb}. The fraction 
of the set of P-BC parameters for which the channel is degraded is 


-^11 dS2 dsi = 


Si>S2 


0.5k-^ if fc > 1 , 
l-0.5fc iffc<L 


The fraction of the set of P-BC parameters for which superposition coding is optimal is lower bounded by the fraction for 
which the P-BC is less noisy. Consider 

/ dads2dsi 

Si <S2 j Sl/S2 

7 ^(1+si )/(l-t-s2) 


1 - 


1 


= 1 - 


= 1 - 


1 


da ds 2 dsi 


'0 d Si dsi/s2 
■'bo pkb 


hri 


= 1 - 


1 f{l + bo)‘^ — l fi , I ^ 0^1 ; r 1 + ^ 0 ^, , r \ 1 ^ 0 ^ log 60 , K 

- - -log(l + kb)-— log kb - - — log(l -f bo) -f- - - + Y 


kb"^ 

1 


-'-wrdrbo. 


1 + kb 1 + boi log(l-f6o) 


kb 


--log 


+ feo( log(l + kb) - log(l + fco) + ^)^ , 


which approaches to 1 as 6 -5- cx). 
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